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ABSTRACT 

The X-ray properties of a sample of 11 high-redshift (0.6 < z < 1.0) clusters observed 
with Chandra and/or XMM-Newton are used to investigate the evolution of the cluster 
scaling relations. The observed evolution in the normalisation of the L — T, M — T, 
M g — T, and M — L relations are consistent with simple self-similar predictions, in 
which the properties of clusters reflect the properties of the universe at their redshift of 
observation. Under the assumption that the model of self-similar evolution is correct 
and that the local systems formed via a single spherical collapse, the high-redshift 
L — T relation is consistent with the high-z clusters having virialised at a significantly 
higher redshift than the local systems. The data are also consistent with the more 
realistic scenario of clusters forming via the continuous accretion of material. 

The slope of the L — T relation at high-redshift (B = 3.32 ± 0.37) is consistent 
with the local relation, and significantly steeper then the self-similar prediction of 
B = 2. This suggests that the same non-gravitational processes are responsible for 
steepening the local and high-z relations, possibly occurring universally at z > 1 or in 
the early stages of the clusters’ formation, prior to their observation. 

The properties of the intra-cluster medium at high-redshift are found to be similar 
to those in the local universe. The mean surface-brightness profile slope for the sample 
is (3 = 0.66 ±0.05, the mean gas mass fractions within i? 2500 (z) an d -^ 200 ( 2 ) are 0.069 ± 
0.012 and 0.11 ± 0.02 respectively, and the mean metallicity of the sample is 0.28 ± 
O.11Z 0 . 

Key words: cosmology: observations - galaxies: clusters: general - galaxies: high- 
redshift - intergalactic medium - X-rays: galaxies 


1 INTRODUCTION 

A simple and useful model to describe galaxy clusters is that 
they are self-similar. In this model, clusters form via the 
collapse of the most overdense regions in the early universe, 
and the cluster baryons are heated only by gravitational pro¬ 
cesses (compression and shock heating) during the collapse. 
The properties of clusters at high redshift are then identi¬ 
cal to those of their low-redshift counterparts, apart from 
scaling factors reflecting the increase of the mean density 
of the universe with redshift. This scaling with redshift has 
been termed weak self-similarity ( e.g. Bower 1997). I 11 the 
strongest form of the model, the mass profiles of all clusters 
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at the same epoch follow the same shape independent of 
their total mass. The self-similar model then allows proper¬ 
ties of clusters of different masses and at different redshifts 
to be related to one-another according to simple scaling laws. 

X-ray observations provide a powerful way of measuring 
cluster properties, and have provided a wealth of evidence 
that galaxy clusters do not scale self-similarly with mass (or, 
by proxy, temperature) in the local universe. For example; 
the slope of the X-ray luminosity-temperature (L — T) rela¬ 
tion is steeper than self-similar predictions (e.g. Markevitch 
1998; Arnaud & Evrard 1999); the slopes of the gas-density 
and surface-brightness profiles are shallower in cooler sys¬ 
tems (e.g. Lloyd-Davies et al. 2000; Sanderson et al. 2003); 
and the entropy in cluster cores is higher than predicted 
(e.g. Ponman et al. 1999, 2003). These departures from self¬ 
similarity are generally taken as evidence for the importance 
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of non-gravitational contributions (such as heating by AGN 
or radiative cooling) to the energy budget of clusters. 

In this paper we address the open question of whether 
the simple evolution of the cluster scaling relations pre¬ 
dicted by the self-similar model is obeyed. We define evo¬ 
lution as any change with respect to the local scaling rela¬ 
tions, and compare any such evolution with the self-similar 
predictions. The results of early studies of the evolution of 
the L — T relation were consistent with little or no evolu¬ 
tion ( e.g. Mushotzky & Scharf 1997; Donahue et al. 1999; 
Fairley et al. 2000). More recent studies with Chandra and 
XMM-Newton have found significant evolution in the L — T 
relation (Viklilinin et al. 2002; Ettori et al. 2004; Lumb et al. 
2004). This change is due to the availability of larger sam¬ 
ples of clusters at higher redshifts, and differences in the as¬ 
sumed cosmological model. The measured evolution is larger 
in a ACDM cosmology than in the Einstein de-Sitter mod¬ 
els which were assumed in many of the earlier studies {e.g. 
Arnaud et al. 2002; Lumb et al. 2004). 

Here we present an analysis of a sample of 11 clusters 
in the redsliift range 0.6 < 2 < 1.0 drawn from the Wide 
Angle ROSAT Pointed Survey (WARPS: Scharf et al. 1997; 
Perlman et al. 2002). A statistically complete, flux-limited 
(Ex(0.5 — 2 keV) > 6.4 x 1CP13 erg s^ 1 cm -2 ) sample of 
13 WARPS clusters was originally observed with Chandra 
and/or XMM-Newton, but two of the XMM-Newton ob¬ 
servations were rendered unusable due to extremely high 
background levels, leaving the 11 clusters discussed here. Of 
these clusters, two now fall below the flux limit due to point 
source contamination in the original ROSAT observations, 
however the set of 11 clusters used in this work should be 
fairly unbiased. These clusters’ properties, and the scaling 
relations derived from them, are compared to those of other 
samples at high and low redshift, and with the predictions 
of different cluster-formation models. 

A ACDM cosmology of Ho = 70 km s -1 Mpc -1 = 
100/iTO km s -1 Mpc -1 , and 11 m = 0.3 (Ha = 0.7) is adopted 
throughout, with the convention that Hm represents the 
present-day matter density, while 12 m («) represents its value 
at redshift 2 . All errors are quoted at the 68% level. 


2 DATA ANALYSIS 

The standard data reduction steps were followed for both the 
Chandra and XMM-Newton observations, and are discussed 
in detail in Maughan et al. (2003) ( Chandra ) and Maughan 
et al. (2004) ( XMM-Newton ). Data reduction was performed 
with Ciao 3.2.2 and SAS 6.1 for Chandra and XMM-Newton 
respectively. In summary, the data were filtered to remove 
high-background periods, and a surface-brightness profile 
was extracted (with point sources excluded) to determine 
the extent of the cluster emission. The detection radius (ra) 
of the cluster was then defined as the radius outside which no 
further emission was detected at the 3<r level in the surface- 
brightness profile. Spectra were extracted from within the 
detection radius, and were fit in the 0.4 — 7 keV band with 
an absorbed MeKaL model (Kaastra & Mewe 1993). The 
low energy cutoff was chosen to minimise calibration uncer¬ 
tainties at low energies for both XMM-Newton and Chandra. 
Ignoring energies above 7 keV has little effect on Chandra 
data due to the low effective area at those energies, and 


avoids instrumental fluorescent lines at ~ 8 keV which vary 
spatially across the XMM-Newton PN detector. The absorb¬ 
ing column was frozen at the Galactic value determined from 
21 cm radio observations (Dickey & Lockman 1990) during 
the spectral fits. When the spectra were of sufficient quality, 
the absorbing column was also fit, and found to be consistent 
with the Galactic value. The surface-brightness distribution 
of each system was modeled with a two-dimensional /3-model 
(Cavaliere & Fusco-Femiano 1976). 

The issue of background subtraction was carefully con¬ 
sidered when modeling both the spectra and the surface- 
brightness distributions. In all cases the background was 
measured from the same observation in a region as close to 
the source as possible, while avoiding contaminating source 
emission. Due to the higher background levels in the XMM- 
Newton observations, the background in the XMM-Newton 
surface-brightness models included two components to ac¬ 
count for the flat and vignetted background components 
(Maughan et al. 2004). Consistency checks were performed 
for several clusters from the sample using backgrounds de¬ 
rived from different regions of the source datasets and from 
blank-sky datasets {e.g. Maughan et al. 2003; Maughan et al. 
2004). The derived properties were generally independent of 
the background used. Point sources were masked out from 
all source and background regions during the analyses. 

During the spectral-fitting process, the model redshifts 
were frozen at the values derived from optical spectroscopy, 
and best-fitting temperatures were found with the metal- 
licity (Z) fixed at O.3Z0. The metallicity parameter was 
then allowed to vary in addition to the model temperature 
and normalisation, and its best-fitting value was found. In 
all cases, the best-fitting model temperatures obtained with 
and without the metallicity free to vary agreed within la. As 
the metallicities were not always well constrained, the tem¬ 
peratures derived with metallicity fixed at 0.3 Zq are used 
throughout this work. The effective area of the instruments 
was taken into account in the spectral modeling by using 
ancillary response Hies (ARFs) generated for the cluster po¬ 
sitions, and weighted by the spatial distributions of source 
photons. 

For the purposes of spectral fitting, additional filtering 
was applied to the XMM-Newton data retaining only events 
with FLAG and PATTERN parameters equal to zero. These 
correspond to events detected in single pixels which were not 
close to CCD gaps. The energy calibration of these events 
is the most reliable. The loss of effective area to CCD gaps, 
bad pixels, and excluded point sources within the spectral 
extraction region was accounted for by correcting the nor¬ 
malisation of the ARFs. While the ARF files already account 
for these losses in principal, they do not take into account the 
surface brightness distribution of the source. We included 
this effect in our correction as follows. For each cluster, a 
background-subtracted radial profile of the source region 
was produced, excluding “dead regions” (CCD gaps, bad 
pixels and point sources). In each radial bin, the measured 
flux was used to predict the number of counts that would 
have been detected if there were no dead regions. These 
counts were summed over the source region, and the ARF 
normalisation was scaled by the ratio of the detected counts 
to the predicted total counts if there were no dead regions. 
This process was not required for the Chandra observations 
as the spectral extraction regions were unaffected by dead 
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regions. The spectral redistribution matrix files were gen¬ 
erated with SAS version 6.1 using the calibration database 
appropriate for that release, which included improved cali¬ 
bration of the PN response at low energies. 

All of the clusters in our sample, with the excep¬ 
tion of C1J0046.3+8530, were the target of their observa¬ 
tions and were located close to the optical axis. However 
C1J0046.3+8530 was observed serendipitously near the edge 
of the field of view in two XMM-Newton observations, which 
introduced some additional calibration issues due to the 
broader PSF. The analysis of this system is discussed in 
detail in Maughan et al. (2004). 

The slope (f3 ) and core radius (r c ) of the gas-density 
profile were derived from a two-dimensional elliptical (3- 
model fitted to an image of the cluster emission, including 
the effects of vignetting and the PSF. The ellipticity of the 
model was defined as e = (1 — 6/a), where a and b are the 
semi-major and -minor axes respectively. With the central 
gas density derived from the normalisation of the best-fitting 
spectral model ( e.g. Maughan et al. 2003), it was then pos¬ 
sible to derive the gas-mass profile assuming spherical sym¬ 
metry. The total mass profile was then derived under the 
assumptions of isothermality, hydrostatic equilibrium, and 
spherical symmetry. From this total mass profile, an over¬ 
density profile (with respect to the critical density p c (z) at 
the cluster’s redshift) was derived, enabling the measure¬ 
ment of overdensity radii, Ra- Here Ra refers to the radius 
within which the mean density is Ap c (z), and A is an over¬ 
density factor. We take A to be a function of redshift, as 
explained in §4, and denote radii defined in this way as e.g. 
R200 (z) where A = 200 at z = 0. 

2.1 Computation of errors 

The uncertainties on the derived properties of a cluster were 
obtained in the following way. Distributions of the derived 
properties were computed from 10,000 randomisations of the 
observed properties within their uncertainty distributions. 
The la confidence limits on each of the derived properties 
were then obtained from the ±34 percentiles about the best- 
fit value. This method treats the statistical uncertainties in 
extrapolating properties to different radii self-consistently, 
but does not account for any systematic uncertainties in ex¬ 
trapolating properties beyond the limits of the data. A sig¬ 
nificant source of systematic uncertainty is the assumption 
of isothermality. Data of sufficient quality to measure tem¬ 
perature profiles were only available for two clusters in the 
sample, and in those cases the assumption of isothermality 
was justified. The systematic effect that undetected temper¬ 
ature gradients could have on derived masses is discussed in 
§7 and §8. 

An additional consideration is the well known positive 
correlation between the (3 and r c surface-brightness profile 
parameters. Models with large (3 and r c are similar to those 
with small values of the parameters. The error treatment 
described above assumes that all errors are independent. 
The effect of this assumption was investigated in the follow¬ 
ing way: A simple cluster image was simulated by adding 
Poisson noise to a two-dimensional /3-profile image. A (3- 
model was then fit to this image, and a two-dimensional 
probability distribution of f3 and r c values was generated. 
Cluster properties were then derived for an assumed tem¬ 


perature and MeKaL normalisation, and the uncertainties 
were computed by sampling pairs of (3 and r c values from 
the two-dimensional probability distribution, thereby accu¬ 
rately reflecting their correlated errors. The uncertainties 
on all other parameters were assumed to be negligible. This 
process was repeated assuming uncorrelated (3 and r c errors, 
and also assuming negligible errors on r c . It was found that 
the latter two methods gave similar uncertainties on cluster 
properties, and these uncertainties were approximately twice 
as large as those derived using the true, correlated errors on 
f3 and r c . In fact, the uncertainties on the properties of the 
observed clusters are dominated by the measurement errors 
on the temperature, and the choice between these methods 
makes a negligible contribution to the error budget. With 
this in mind, the measurement errors on r c were ignored in 
the calculation of the errors on all cluster properties. 

An interesting consequence of this self-consistent treat¬ 
ment of errors is that the fractional uncertainty on -R 2500 is 
in general significantly larger than that on i? 2 oo- This is in 
spite of the fact that the data for all of the clusters extend 
beyond -R 2500 , but must be extrapolated to R200 (see Table 
1). The reason for this is that without including any system¬ 
atic uncertainties due to the extrapolation, the distribution 
of /3 values has a larger effect on the shape of the density pro¬ 
file at small radii than large (>> r c ) radii. Fig. 1 illustrates 
this point, showing a range of 50 overdensity profiles derived 
for a 5.5 keV cluster at z = 0.833 with r c = 250 kpc and val¬ 
ues of (3 randomly drawn from the distribution 0.67 ± 0.07. 
These uncertainties on f3 are the average fractional uncer¬ 
tainties on our sample, and so reflect the typical uncertainty 
in the shape of the overdensity profiles. In these simulations, 
the fractional error on -R 2500 is 0.11, while that on R200 is 
0.05. The self-consistent treatment of the uncertainties can 
thus result in larger fractional error on quantities derived 
within the extent of the data than those extrapolated to 
large radii beyond the data. While the measurement errors 
on kT have a large effect on the uncertainty in the normali¬ 
sation of the overdensity profiles, this source of uncertainty 
is not important for this comparison as it does not affect the 
shape of the profiles. 


3 THE SAMPLE 

In this section each cluster in the sample is discussed briefly, 
highlighting any unusual or interesting aspects of its analy¬ 
sis and properties. Images of each of the clusters with con¬ 
tours of the adaptively smoothed X-ray emission overlaid are 
shown in Fig. 2 and the observed properties of the sample 
are summarised in Table 1 in redshift order. The detection 
radii of each cluster as a fraction of I? 2500 (z) and R 200 (z) 
are also given in Table 1. The method used to define and 
measure these overdensity radii is described in §4 and their 
values are given in Table 2 for each cluster. 

C1J0046.3+8530. This cluster was observed serendipi¬ 
tously 11' off axis in two consecutive XMM-Newton obser¬ 
vations of the open star cluster NGC 188. These data are 
discussed in detail in Maughan et al. (2004). The cluster 
has a reasonably relaxed morphology, and its temperature 
profile and hardness-ratio map are consistent with isother¬ 
mality out to 70% of the virial radius, within the statistical 
limits of the data. 
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Figure 1 . 50 sample overdensity profiles generated for a distribution of 0 values. The horizontal lines mark overdensities of 200 and 
2500. The overdensity at R is defined as the ratio of the mean density within R to the critical density. 


C1J1342.9+2828. The XMM-Newton observation of this 
system shows a core region that is elongated in the East- 
West direction, with two possible X-ray peaks. This mor¬ 
phology is indicative of a late-stage merger. 

C1J1113.1 — 2615. The Chandra observation of this cluster, 
which is discussed in detail in Maughan et al. (2003), shows a 
reasonably relaxed, although slightly elliptical, morphology. 
C1J1103.6+3555. Observed with XMM-Newton, this clus¬ 
ter has a fairly disturbed X-ray morphology, suggesting that 
it may not have dynamically relaxed after a recent merger. It 
also appears to be surrounded by more extended low surface- 
brightness emission. 

C1J0152.7—1357. This spectacular system is probably an 
early-stage merger between two equally massive clusters 
C1J0152.7— 1357N and C1J0152.7—1357S and has been stud¬ 
ied in some detail (Ebeling et al. 2000; Maughan et al. 2003; 
Huo et al. 2004; Jee et al. 2005). The Chandra observa¬ 
tion used here suggests that both of the clusters are reason¬ 
ably relaxed, with C1J0152.7—1357N the more elliptical. A 
recent, deep XMM-Newton observation has detected some 
substructure in C1J0152.7—1357N, while C1J0152.7—1357S 
still appears relaxed (Maughan et. al. in preparation). 
C1J1559.1+6353. This cluster was observed by XMM- 
Newton. The data show an elliptical morphology (e = 0.35; 
see Table 1), suggesting that the system may not yet have 
fully relaxed back into hydrostatic equilibrium after its last 
merger event. A bright, variable point-source « T South of 


the cluster centroid led to an overestimate of its ROSAT 
flux in the WARPS (Horner et. al. in preparation). For this 
system, a very high, poorly constrained metallicity was mea¬ 
sured, with relatively poor constraints on temperature (see 
Table 1). This is likely to be due to some remaining contam¬ 
ination from the bright source caused by the large wings of 
the XMM-Newton point spread function, and to the fact 
that the spectrum had a lower signal-to-noise than most of 
the other systems discussed here. 

C1J1008.7+5342. The morphology of this cluster appears 
fairly relaxed in the XMM-Newton observation. However, 
the ellipticity of the best-fitting surface-brightness model is 
0.27, indicating that the system may be disturbed to some 
degree. 

C1J1226.9+3332. This system has been the target of both 
Chandra (Cagnoni et al. 2001) and XMM-Newton (Maughan 
et al. 2004) observations, and was found to be extremely 
hot, with a regular, relaxed morphology. Based on a 16 ks 
XMM-Newton observation Maughan et al. (2004) measured 
a temperature profile and hardness-ratio map which were 
consistent with the cluster being isothermal out to 45% of 
the virial radius. In addition, the temperature and lumi¬ 
nosity measured from the Chandra and XMM-Newton data 
were found to be in good agreement (Maughan et al. 2004). 
The properties derived from the XMM-Newton data are used 
throughout this work. A forthcoming deep XMM-Newton 
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Figure 2. Contours of X-ray emission overlaid on optical images of the WARPS high-redshift sample. The pass-bands and telescopes 
used to produce the optical images varied, with the I, R or Z band, and Keck-II, the 4.2m William-Herschel Telescope, Subaru or the 
University of Hawaii’s 2.2m telescope used. Contours are taken from images that were adaptively smoothed (using the asmooth algorithm 
of Ebeling et al. (2005)) such that all features are significant at the 99% level, and are logarithmically spaced, with the lowest contour a 
factor of 1.5 above the background. The X-ray observatory used, and other information about the clusters is given in Table 1. 


observation will allow this massive cluster to be studied in 
unprecedented detail. 

C1J1429.0+4241. This cluster was observed serendipi- 
tonsly during an XMM-Neuiton observation of the BL Lac 
H1426+428. Only the data from the MOS2 detector were 
useful for its study because the other detectors were in fast 
timing mode. The cluster’s morphology is fairly disturbed, 
particularly in the centre, with a possible second X-ray peak 
to the West. During the two-dimensional surface-brightness 
modeling of this system, it was not possible to constrain 
the (3 parameter. This is likely to be due to a combination 
of the system’s morphology, its compactness compared with 
the XMM-Newton PSF, and the relatively shallow imaging 
with the single MOS detector. The fit was thus performed 
with (3 fixed at the canonical value of 0.67, and the errors 


quoted on /3 hereafter are the mean fractional errors of the 
rest of the sample (9%). 

C1J1415.1+3612. At z = 1.03, the most distant cluster in 
the sample. The XMM-Newton data show the morphology to 
be relaxed. A deep (78 ks) Chandra observation (Ebeling et. 
al. in preparation) of this system confirms the relaxed mor¬ 
phology, and rules out significant unresolved point source 
contamination in the XMM-Newton data used here. 

The measurement of reliable masses for these clus¬ 
ters based on the X-ray data requires that they be in 
hydrostatic equilibrium. Five systems (C1J1342.9+2828, 
C1J1103.6+3555, C1J0152.7-1357N, C1J1559.1+6353 and 
C1J1429.0+4241) are possibly unrelaxed showing disturbed 
morphologies, possible substructure and/or large elliptici- 
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Table 1. Summary of the observed properties of the WARPS high-redshift sample. Column 3 indicates whether the observation used here 
was made with Chandra (C) or XMM-Newton (X). Column 4 gives the on-axis exposure time remaining after removal of high-background 
periods (the mean of the MOS and PN times is given for XMM-Newton observations). Detailed analyses of several of the clusters can be 
found in “Maughan et al. (2004), ^Maughan et al. (2003) and c Maughan et al. (2004). ' The detection radius is given in units of -R2500(z) 
and f?200(z) f° r each cluster. These radii are defined in §4 and given in Table 2. The value of 0 for this cluster was fixed at 0.67 during 
the surface-brightness fitting, and the error quoted here is the mean fractional error on 0 in the rest of the sample. 


ties. The mass estimates for these systems are thus likely to 
be less reliable than those for the more relaxed systems. 

3.1 Comparison of Chandra and XMM-Newton 
Temperatures 

In compiling a sample which includes observations per¬ 
formed with two different instruments, the accuracy of their 
cross-calibration is an important consideration. In partic¬ 
ular, for the study of scaling relations, it is important 
that no systematic bias in the measured temperature is 
present due to the imperfect calibration of either instru¬ 
ment. For 4 of the clusters in the sample (C1J0152.7—1357N, 
C1J0152.7—1357S, C1J1226.9+3332 and C1J1415.1+3612) 
observation made with both Chandra and XMM-Newton 
were available to us. This enabled us to measure tempera¬ 
tures in a consistent way with both observatories. The mea¬ 
sured temperatures are plotted in Fig. 3. 

While limited by the small number of comparison points 
and the size of the statistical uncertainties on the tem¬ 
peratures, Fig. 3 suggests no gross systematic disagree¬ 
ment between the Chandra and XMM-Newton tempera¬ 
tures. This evidence, along with the good agreement between 
the WARPS L — T relation and the Chandra L — T relation 
of Vikhlinin et al. (2002) (discussed in detail in §6.1) indicate 
that the cross calibration of the two satellites is not a signifi¬ 
cant problem in this work. In studies of nearby clusters with 
high quality observations, some systematic temperature dis¬ 
agreements between Chandra and XMM-Newtonh&ve been 
found ( e.g. Sanderson et al. 2004; Vikhlinin et al. 2005). 
Given the evidence above, it appears that any such effect is 
small compared to the statistical uncertainties in this work. 


4 CLUSTER SCALING THEORY 

When comparing integrated cluster properties such as lumi¬ 
nosity and mass with theoretical predictions, simulations, or 
other work, the radius within which the properties are mea¬ 
sured is of great importance. It is common to use a fixed 
physical size as an outer radius, which has obvious benefits 



Figure 3. Comparison of temperatures measured with Chandra 
and XMM-Newton for the four clusters in our sample observed 
by both. 


in terms of simplicity. A more sophisticated method is to 
define a radius with some knowledge of the cluster’s mass 
profile so that the mean enclosed density is a fixed factor 
above the critical density of the universe. This method is 
more appropriate for comparisons with theoretical predic¬ 
tions, which predict the outer boundary of the virialised part 
of clusters in terms of a density contrast (A v ). For instance, 
in an Einstein-de Sitter universe, the mean density of viri¬ 
alised systems is A v = 187r 2 p c at all redshifts {e.g. Bryan 
& Norman 1998). However, the value of A v and its varia¬ 
tion with redsliift are cosmology dependent (as, of course, 
is p c (z)). Bryan & Norman (1998) calculate the redsliift de¬ 
pendence of A v in a ACDM cosmology for clusters which 
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have just virialised, fitting the solution with the expression 

A v (» = 187r 2 + 82(On(z)-l)-39(n m (2)-l) 2 , (1) 

where fl m ( 2 ) = Om(1 + z) 3 / E(z ) 2 , and E(z) describes the 
redshift evolution of the Hubble parameter, given by 

E ( 2 ) = f2iv[(l + z) 3 + (1 — 11 m — 11a)( 1 + z) + • (2) 

Equation f is accurate to within 1% in the range 0.1 < 
Q(z) < 1 (Bryan & Norman 1998), which corresponds to all 
2 for S2 m = 0.3. 

The ideal method to follow would therefore be to mea¬ 
sure cluster properties within radii corresponding to A v (z) 
at all redshifts. This is not possible because X-ray measure¬ 
ments typically only extend to a fraction of the radius cor¬ 
responding to A v (z), even at low redshifts, requiring the 
extrapolation of measured properties by large factors. An 
alternative to this extrapolation is to work at a smaller ra¬ 
dius, corresponding to a higher density contrast, A(0), at 
2 = 0. In the self-similar model, this contrast will scale with 
redshift according to 

a « = A(0 >£®- m 

In our analysis, all cluster properties are extrapolated to 
several different radii to enable comparison with other work. 
We note that for comparison with self-similar models, prop¬ 
erties measured within a redshift-dependent density contrast 
are the most appropriate. In order to simplify the notation, 
integrated quantities with a numerical subscript, say Ma(z), 
refer to that quantity within a radius (-Ra(z)) enclosing a 
mean density of A( 2 )p c ( 2 ). For example, M 2 oo(z) refers to 
the mass within a radius enclosing a mean overdensity of 200 
at 2 = 0, and some higher mean overdensity (given by Equa¬ 
tion 3) at higher redshifts. Note that this use of a redshift- 
dependent density contrast to define radii differs from the 
methods used in our previously published studies of some 
of these clusters. In Maughan et al. (2004) and Maughan 
et al. (2004), fixed (redshift-independent) density contrasts 
were used, while in Maughan et al. (2003) the virial radii 
were estimated from the temperatures of the clusters. The 
values of integrated cluster properties quoted in this paper 
thus differ from those given in the previous studies. 

Under the self-similar model, with the assumptions that 
clusters are spherically symmetric systems and that they 
virialised at the redshift of observation (late formation), sim¬ 
ple scaling relations between cluster properties can be de¬ 
rived, based on the virial theorem ( e.g. Bryan & Norman 
1998). The total gravitating mass within a radius R A ( Z ) is 
related to the gas temperature by 

M a(z) E{z)A{z) 1/2 x kT 3/2 (4) 

and the gas mass within R A (z) is given by the relation, 
M 9 a W E(z)A( 2) 1/2 (X fcT 3/2 / gas . (5) 

If the relative distribution of gas and dark matter does 
not change with redshift or temperature, i.e. the gas-mass 
fraction / gas is independent of 2 and kT, then Equation 5 be¬ 
comes similar to Equation 4. This assumption is supported 
by measurements of / gas in high-redshift clusters that are 
consistent with those in local clusters {e.g. Allen et al. 2002; 
Maughan et al. 2004). However, it has been found that / gas 
is lower in clusters with temperatures below ~ 3 — 4 keV 


(Sanderson et al. 2003, hereafter S03), which may invalidate 
the assumption of invariant / gas for those systems. 

Under the additional assumption that clusters emission 
is dominated by bremsstrahlung in the X-ray band (a rea¬ 
sonable assumption at the temperatures considered here), 
the X-ray luminosity within R A (z) is given by 

Aa ( z)E( 2)- 1 A(2)- 1/2 x fcr 2 / g 2 as . (6) 

The scaling between total mass and X-ray luminosity 
(M — L relation) is then given by combining Equations 4 
and 6, yielding 

L a(z) E(z)- 7/3 A(z)- 7/6 x s . (7) 


5 SUMMARY OF CLUSTER PROPERTIES 
AND LINE-FITTING METHODS 


The methods used to derive the properties of the clusters 
in the sample can be summarised as follows. Overdensity 
radii R A ( Z ) were measured as described in §2. The luminosi¬ 
ties measured within ra were extrapolated to R A ( Z ) using 
the best-fitting surface-brightness profiles. The fractional 
detection radius rd/f? 200 (z) varied from 0.3 to 0.9, while 
I'd/R 25 qo(z) was greater than unity for all clusters (see Table 
1). The luminosities were thus scaled from ra to f? 2 oo(z) by 
factors in the range 1.0 — 1.5 (median= 1.1). 

Similarly, the gas masses were derived at different -Ra(z) 
by extrapolation of the gas-mass profiles, obtained from 
the surface-brightness profiles. The extrapolation factors of 
M gas from ra to _R 2 oo(z) were in the range 1.1 — 5.3 with a 
median of 1.9. The total-mass profiles were used to derive 
the total masses at different overdensity radii, being scaled 
up by a factor in the range 1.1 — 3.8 (median= 1.7) from ra 
tO f?200(z)- 

For C1J0046.3+8530 and C1J1226.9+3332, temperature 
profiles and hardness-ratio maps support the isothermal as¬ 
sumption to at least 0.6f?2oo(z) ■ The effect of possible depar¬ 
tures from isothermality is investigated below. 

To enable comparisons with other work, the integrated 
properties of the WARPS sample are summarised in Table 
2 , derived within different radii. 

The observed properties of the WARPS sample, and 
others, are compared with the predicted scaling relations in 
the following sections. The best-fitting scaling relations to 
the observations were found by performing an orthogonal, 
weighted “BCES” regression (as described by Akritas & Ber- 
shady 1996), on the data in log space. This method takes into 
account measurement errors on both variables, correlations 
in those errors, and intrinsic scatter in the data. The use of 
orthogonal regression avoids the biases inherent in bisector- 
regression fits to data with intrinsic scatter. The errors on 
the slopes and normalisations of the relations were derived 
from jackknife analyses of the datasets. The y 2 goodness of 
fit values of the various relations were computed including 
the errors in both the x and y directions. In a similar way, 
the intrinsic scatter (cr s ) of the data about the relations was 
defined in log space as 


1 

N 


E 


{yi - mxi - c) 2 


2 1 1/2 


v 2 yi+™ 2 ali J 


( 8 ) 


where m and c are the gradient and intercept of the relation, 
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Cluster 

z 

kT 

-^200(z) 

^200(z) 

Alg 200(2) 

+1+200(2) 

^2500(z) 

^2500 (z) 

++ 2500(2) 

+1+2500(2) 



keV 

Mpc 

10 44 erg s -1 

10 13 Mq 

10 14 M© 

Mpc 

10 44 erg s _1 

10 12 M© 

1O 13 M 0 

Cl J0046.3+8530 

0.62 

4 - 4 /?/ 

0.96/°;°? 

4 11+0.23 

3.69/?;?? 

9 oi +0.59 
^•ol— o 56 

0 24 +0 ' 02 
u - Z4 -0.03 

2 01 + 0,35 
z - ul -0.29 

3 74+ 0 • 86 

0.' ^_o.83 

5.29/1;?? 

C1J1342.9+2828+ 

0.71 

q 7+0.5 
-0.4 

0.88/?;?? 

q ro+0.14 
o.oz_ 0 lg 

3.08/?;?? 

2 46“*" 0,59 
Z -^°-0.52 

n i q+o .02 

U.J-O— 0.03 

1 50"*"°' 36 

l.OU-0 34 

9 97+O.83 
- 0.76 

2.86/1;?? 

C1J1113.1—2615 

0.72 

4.7/°;? 

0.97/?;?? 

Q oc+0.32 

O.OO—o 33 

9 Q9 + 0.35 
z -^ z - 0.33 

3 3Q+ 1-02 

O.OU-o 78 

0 26 -1 " 0 ' 02 
U.^O_0.03 

2 fiq+°- 26 
Z.DU-o 24 

4 75+ 0 - 94 

^•<0—0.76 

7-75/?/? 

C1J1103.6+3555+ 

0.78 

6.0/°;? 

0.98/?;?? 

4.98/?;?? 

4.06/?/? 

q 09+0.88 
o.o^_Q 73 

0 24 +0 ' 02 
u ' z ^-0.02 

2 30+ 0 - 31 

Z.OU-Q 3Q 

q Q9+0.89 

°- yz -o 76 

7 99+2.09 

'• zz - 1.87 

C1J0152-7—1357N+ 

0.83 

c R+ 10 
°-°-0.8 

i oo +010 

l.UU_ 0 . 12 

ii-5o/l;|? 

6-48/?+ 

4 32+ 1,54 
^• oz -1.34 

0 1 c:+ 0 - 05 

U. TO—o.08 

2 62" 1 " 1 ' 58 
z - oz -1.85 

1.77/?/? 

1.90/?/? 

C1J0152-7— 1357S 

0.83 

^•°-i.o 

0.90/?/? 

6 72+ 1,07 
v - lz - 0.89 

3.84/?;?? 

3 14+ 1 - 22 

r» 99+O.O3 
u - zz - 0.04 

3 96" 1 " 0,57 
0 ,y0 —0.65 

4.86/;/? 

6.16/?/? 

C1J1559.1+6353+ 

0.85 

4 1+ 1 * 4 

0.78/?/? 

2.59/?;?? 

1 QO + 0 - 51 

3 g 

9 i 9+1.40 
z - lz - 0.84 

0 21 + 0 04 

i fiS+ 9 ' 39 
l.DO-0 46 

2-86/?/? 

5.28/?;?? 

C1J1008.7+5342 

0.87 

o c+0.8 
0.6 

0 76+ 0 - 08 
u - '°-0.09 

3-84/?;?? 

2 74+ 0 - 47 
-0.32 

2.00/?;?? 

n 14+0. 94 
U - 14 -0.05 

i-i6/?:?? 

1 9q+1.15 

J-.^O-Q 77 

1 1-7+1.56 
1 -0'— 1.06 

C1J1226.9+3332 

0.89 

10.6/1/ 

1 OQ +006 
i - zy _0.08 

43 7f)+°- 96 

40 . mj_o 96 

11 qn+0.89 

ii.yu_o 82 

10.20/1;?? 

0 35+ 0 - 02 

U.OO-0 03 

32.70/;/? 

21.50/?;?? 

25.00/?;?? 

C1J1429.0+4241+ 

0.92 

6 2 +1 ' 5 
oz -1.0 

0 97+ 011 
u,y '-0.11 

9.59/?;?? 

4 29" 1 " 0 ' 76 
^ ,z,y -0.60 

4 49+ 1 - 73 
^•^ y - 1.29 

0 26" 1 " 0 ' 03 
u ' zo -0.04 

6 92" 1 " 0,70 
o.y^-o 76 

7.28/1/? 

10.50/?;?? 

C1J1415.1+3612 

1.03 

r 7+1.2 

O QQ+ 9,9 ® 
U-OO —0 08 

10-40/?;?? 

q oc+0.54 

o.OO-o 43 

q oq+1.20 
O.OO—o. 94 

r» 9q+0.02 

O.^O_0 03 

7 58" 1 " 0 ' 70 

I.OO-0 81 

6.73/1/? 

0 09+3.12 
0.0^-245 


Table 2. Summary of the integrated properties of the WARPS high-redshift sample derived within different radii. Luminosities are 
bolometric X-ray luminosities, + These systems show evidence of being unrelaxed 


and (xi, yi) are the coordinates of each of the N data points, 
with uncertainties [a x i,a y i). This measurement was used to 
compare the scatter in the different relations discussed here. 


6 THE L - T RELATION 


The effect of cool cores needs to be accounted for in any 
measurement of the L — T relation. Many relaxed clusters 
have dense cooling cores, with gas temperatures in the cen¬ 
tral ~ 100 kpc falling to ~ 1/3 of the global temperature, 
and sharply peaked surface-brightness profiles ( e.g. Fabian 
1994; Kaastra et al. 2001). These effects are not included in 
the self-similar model, so must be taken into account in the 
analysis. This is routinely done by excluding the central re¬ 
gion of such clusters from the analyses. None of the clusters 
in our sample showed significant evidence for cooling cores; 
where temperature profiles and liardness-ratio maps could 
be created (C1J0046.3+8530 and C1J1226.9+3332) there was 
no indication of cooler gas in the central regions, and none 
of the surface-brightness profiles were centrally peaked. For 
this reason, no cooling-core correction was applied to the 
luminosities or temperatures of the high-redshift clusters. 

If the L — T relation evolves as predicted by Equation 
6 , then the effect of evolution can be removed by dividing 
the luminosity of each cluster by E(a)^ 1 (A(2:)/A(0)) _1 ^ 2 , 
which reduces to E(z)^ 1 (A v (z) / A v (0))~ ll/2 . Following this 
scaling, the high-z clusters should lie on the local relation. Of 
the two factors describing the evolution, E(z) dominates; at 
z = 1, E(z)- 1 = 0.57 and (A v («)/A v (0))“ 1/2 = 0.80. Fig. 4 
shows the scaled luminosities (those with self-similar evolu¬ 
tion factored out) measured within i? 200 (z) plotted against 
temperature. A relation of the form 


E(z)- 1 


Av(a) \ 

Av(0 )J 


- 1/2 

L A(z) 


A 


kT y 
6 keV ) 


(9) 


was fit to the data, resulting in best-fit values A = (5.14 ± 
0.84) x 10 44 /+ o 2 er 8 s _1 , B = 2.78 ± 0.55. The dot-dashed 
line in Fig. 4 shows the best-fitting relation with the ob¬ 
served luminosities, illustrating the effect of the predicted 
evolution. 

Also plotted on Fig. 4 is the local L — T relation mea¬ 
sured by Arnaud & Evrard (1999, hereafter AE99) for a sam¬ 
ple of clusters with little or no central cooling. It is unclear 


within what precise radius the AE99 luminosities were de¬ 
termined, however as they are referred to as “total luminosi¬ 
ties” we assume they are derived within _R 2 oo(z) ■ If the AE99 
luminosities were extrapolated to some larger radius (or to 
infinity), then the luminosities will be <10% higher than 
the values within 1 / 200 ( 2 )• The best fit to our scaled high- 
redshift data is consistent, within the errors, with the pa¬ 
rameters for the local relation of A = (5.86+0.40) x 10 44 /+ 0 2 , 
B = 2.88 ± 0.15. The local AE99 relation provides an ac¬ 
ceptable fit to the scaled high-redshift data (x 2 /^ = 10.2/9), 
while the unsealed high-redshift data rule out the local re¬ 
lation at > 95% level(x 2 /i / = 18.9/9). 

The scaled WARPS high-z L — T relation is also com¬ 
pared with the local relation measured by Markevitch (1998) 
in Fig. 4. In contrast with AE99, Markevitch (1998) cor¬ 
rected for central cooling by excising the central regions of 
the clusters and extrapolating a surface-brightness profile 
over this region. The luminosities measured by Markevitch 
(1998) were extrapolated to a fixed radius of 1.4/i/g 1 Mpc, 
so are expected to be « 2% higher than the values de¬ 
rived within i? 200 (z) (based on the typical scaling of lumi¬ 
nosity from 1.4 /i/q 1 Mpc to _R 2 oo(z) f° r our sample). The 
best-fitting relation found by Markevitch (1998) has A = 
(6.35 + 0.55) x 10 44 /i/ o 2 and B = 2.64 + 0.27 in our notation. 
While the WARPS data are also reasonably consistent with 
this local relation (x 2 ]v = 14.2/9), we prefer to compare the 
results with AE99 because the treatment of central cooling 
is consistent. 


6.1 Comparison with Vikhlinin et al. (2002) 

The results for the WARPS sample was compared with those 
obtained by Vikhlinin et al. (2002, hereafter V02) for a sam¬ 
ple consisting of 22 clusters at z > 0.4 observed with Chan¬ 
dra. In the V02 sample, luminosities were extrapolated to 
a fixed radius of lAh^f Mpc irrespective of their redshift. 
In order to compare the V02 luminosities with self-similar 
predictions, they were scaled to i? 200 (z) • The value of i? 200 (z) 
was computed in an identical way to the WARPS clusters 
(see §2) using the values of z, kT, r c and j3 given by V02. 
Random realisations of the overdensity profiles were com¬ 
puted from the uncertainties on z, kT, 0, given by V02. The 
luminosity of each system was then scaled from lAhjf Mpc 
to R 200 (z) based on /3-profiles with the parameters given by 
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Figure 4. L — T relation for the high-redshift WARPS sample. Luminosities were extrapolated to a radius R‘ 200 (z); corresponding to a 
redshift-dependent density contrast 200, and scaled by a factor of E(z)~ 1 [A(z)/A(0)]~ 1,/2 as predicted by the self-similar model. The 
solid line is the best fit to the data, and the dotted and dashed lines are the local relations of AE99 and Markevitch (1998) respectively. 
The dot-dashed line is the best fit to the unsealed high-redshift clusters (points not plotted) and can be used to judge the significance 
of the self-similar scaling. 


V02 for that cluster. The resulting scale factors for the lumi¬ 
nosities were close to unity, ranging from 0.89 to 1.02 with 
a mean of 0.98. 

As no errors on Lx are given by V02, the mean frac¬ 
tional uncertainty from the WARPS sample of 0.09 was as¬ 
sumed for the V02 clusters. The errors on the luminosities 
scaled to _R 2 oo(z) thus include the statistical uncertainties 
in modeling the overdensity profiles, and an additional 9% 
uncertainty. 

V02 excluded the central 71/r/o 1 kpc of clusters which 
had peaked surface-brightness profiles, and extrapolated 
over that region to correct for central cooling effects. As 
no cooling corrections were applied to the WARPS clusters, 
the corrected clusters were discarded from the V02 sample 
(this is a fairly conservative move as the corrected clusters do 
not scatter from the V02 L — T relation). We also excluded 
C1J0152.7—1357, which is already in the WARPS sample, 
leaving 13 V02 clusters. The best-fitting L — T relation for 
those clusters was consistent with the WARPS L — T rela¬ 
tion shown in Fig. 4. The two samples were then combined 
to give a sample of 22 systems, with 16 at z > 0.6. The lumi¬ 
nosities were scaled as before to remove the predicted self¬ 
similar evolution, and the resulting L — T relation is shown 
in Fig. 5, along with the local AE99 relation. 

The best-fitting parameters for the joint sample are 
A = (6.24 ± 0.62) x 10 44 /© 0 2 and B = 3.32 ± 0.37. This 
is marginally consistent with the local AE99 relation, with 
X 2 /V = 33.3/22. However, this statistic does not take into 
account any intrinsic scatter in the data so the null hypoth¬ 


esis probability of 6% is an underestimate to some extent. 
We thus conclude that the scaled high-redshift data can¬ 
not exclude the local AE99 relation as an acceptable model. 
Comparison of the unsealed data for the combined WARPS 
and V02 samples (indicated by the dot-dashed line in Fig. 5) 
with the AE99 relation strongly rules out the local relation 
as a description of the high-redshift data {x 1 /v = 130/22). 

As the WARPS results are based mainly on XMM- 
Newton observations, a potential source of systematic error 
is that cooling cores in high-redshift systems could go un¬ 
detected due to the large PSF of XMM-Newton. However, 
the best-fitting relation for the combination of the WARPS 
sample and the V02 data (excluding the systems with cool¬ 
ing cores detected by Chandra) is a good description of both 
samples (Fig. 5). Both samples also show a similar amount 
of scatter about the best-fitting relation. This suggests that 
the XMM-Newton measurements were not contaminated by 
undetected cooling cores, and that the cross-calibration of 
the instruments is not a significant concern here. 


6.2 Comparison with Ettori et al. (2004) 

The WARPS sample was then compared with the larger, 
more recently compiled sample of Ettori et al. (2004, here¬ 
after E04) which comprises 28 clusters at z > 0.4 from the 
Chandra archive. In E04, luminosities were extrapolated to 
R ' 500 ( 2 ), where the prime indicates a slightly different defini¬ 
tion of A (z) used in that work. The density contrast in E04 
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Figure 5. L — T relation for the combined WARPS and V02 samples. The WARPS data are the same as in Fig. 4. The V02 data were 
treated in the same way, with Lx extrapolated to # 200 ( 2 ) an d scaled by E(z)~ 1 [A(2:)/A(0)] — 1 / 2 . The solid line is the best fit to the 
combined dataset. 


was defined as 

A'(z)=5 (10) 

so is related to our A (z) by 

A 'W - ^r A (4- (ii) 

In the case of an Einstein-de Sitter universe A \z) = A ( 2 ). 
In our assumed low-density ACDM cosmology however, 
A v (0)/187t 2 = 1.76 and so A \z) < A( 2 ), leading to higher 
measured luminosities within f? 5 oo( z )- This change in defi¬ 
nition thus introduces a redshift-independent change in all 
luminosities, but the predicted evolution of the L — T rela¬ 
tion is unaffected. For consistency with E04, the luminosi¬ 
ties of the WARPS clusters were extrapolated to f? 5 oo( 2 )- 
The resulting WARPS L — T relation is plotted along with 
the E04 relation in Fig. 6. The AE99 relation is also plotted 
for comparison, though it should be recalled that those lu¬ 
minosities were extrapolated to larger radii. In addition to 
any evolution, the normalisation of the AE99 relation should 
thus be ss 5% higher than the R' 50 o(z) relations (estimated 
from the extrapolation of a standard /3-profile from 3?5 0 o( z ) 
to 1 ^ 200 ( 2 )) • 

The best fit to the WARPS R'soo(z) L — T relation is 
consistent with the i? 200 ( 2 ) relation of Fig. 4, and the local 
AE99 relation (see Table 3). The best fit to the E04 relation, 
with A = (5.2 ± 1.1) x 10 44 /© 0 2 and B = 4.28 ± 0.60, is 
possibly steeper than the other L — T relations discussed 
here (at the ~ 2<r level). We note that the slope we measure 
for the E04 sample is steeper than that quoted in E04 (B = 
3.72 ±0.47). This is because we have scaled the luminosities 


by 1?(2) -1 (A v (2)/A v ( 0)) -1,/2 whereas E04 scaled by E(z)~ 1 
alone, and because we use an orthogonal BCES regression, 
while E04 used a bisector BCES regression. If we eliminate 
these two differences, we recover the same fit as E04. 

The scatter of the E04 data about the best-fit L — T 
relation is a s = 1.78 which is larger than that of the com¬ 
bined WARPS and V02 sample (a s = 0.85; c.f. Figs. 5 and 
6). One probably cause of this is that E04 made no correc¬ 
tion for cool cores, while 7 of the clusters were classed as 
having cool cores by V02. These cool core clusters are indi¬ 
cated in Fig. 6, and their exclusion reduces both the scatter 
(cr s = 1.25) and also the normalisation of the best fitting re¬ 
lation. In light of these differences, and the good agreement 
between the WARPS and V02 L — T relations, we prefer 
to use the combined WARPS and V02 L — T relation, with 
luminosities measured within .R2000) in our further discus¬ 
sions. The different L — T relations discussed in this section 
are summarised in Table 3. 


7 THE M T RELATION 

In the self-similar evolution scenario, assuming the late- 
formation approximation, the M — T relation is given by 
Equation 4. The evolution of the M — T relation was investi¬ 
gated by comparing the WARPS sample to the local sample 
of S03, which comprises 66 clusters with reliable masses de¬ 
rived from temperature and surface-brightness profiles. We 
chose S03 for this comparison because the overdensity radii 
and mass measurements were made in the same way as in 
this work. Importantly, S03 also derived total and gas masses 
for all of the clusters in their sample under the assumption 
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L — T relation 

A (10 44 /i 70 2 erg s 

!) B 

notes 

Luminosities extrapolated to T?200(z) 
AE99 5.86 ± 0.40 

Markevitch (1998) 6.35 + 0.55 

WARPS 5.14 + 0.84 

V02 & WARPS 6.24 ± 0.62 

2.88 + 0.15 
2.64 ± 0.27 
2.78 + 0.55 
3.32 + 0.37 

Local clusters with no strong central cooling (Fig. 4). 

Local relation, corrected for cooling cores (Fig. 4). 

0.6 < 2 < 1.0 clusters with no cooling cores (Fig. 4). 

Combined samples (0.4 < 2 : < 1.3), no cooling cores (Fig. 5). 

Luminosities extrapolated to 

E04 5.21 ± 1.05 

E04 (no cool cores) 3.92 ± 1.04 
WARPS (R' 500(z) ) 4.97 ± 0.80 

4.28 + 0.60 
4.21 ± 0.98 
2.80 ± 0.55 

Full E04 0.4 < z < 1.3 sample (Fig. 6). 

E04 0.4 < 2 : < 1.3 clusters with no cooling cores (Fig. 6). 
WARPS 0.6 < 2 : < 1.0 clusters with no cooling cores (Fig. 6). 

Table 3. Summary of 

the L — T relations 

discussed in 

§6. Luminosities of the high-redshift clusters were scaled 


E(z) 1 (Av(2)/A v (0 )) 1 / 2 to remove the predicted self-similar evolution. 


of isothermality. They found that assuming isothermality for 
their entire sample, including genuinely isothermal, and non- 
isothermal clusters lead to an average ~ 30% overestimate 
of the true mass (the mass estimated with full temperature 
profiles) within f? 200 - 

This is a key point in this analysis. In all but a few 
cases, departures from isothermality in the high-z clusters 
cannot be detected. In the cases where there is evidence 
in support of isothermality, the constraints are not strong. 
While some of the high-z systems may be genuinely isother¬ 
mal, others will surely not be. Assuming isothermality for 
the whole sample (while correct for some clusters) is likely 
to lead to an average systematic mass overestimate similar 
to that found by S03. In order to separate this systematic 
effect from any real evolution in the M — T relation, it is 
essential that isothermal masses be used for the low-redshift 
clusters. 


For consistency with the WARPS sample, we include 
only the 40 clusters from the S03 sample with kT > 3 keV. 
The slopes and normalisations of the relations we fit to the 
S03 data vary from those reported in S03 because of this 
temperature cut-off, and because we use a slightly different 
fitting algorithm than S03. 

A reliable masses measurement requires that the 
clusters be in hydrostatic equilibrium. The five pos¬ 
sibly unrelaxed high-redshift clusters (C1J1342.9+2828, 
C1J1103.6+3555, C1J0152.7-1357N, C1J1559.1+6353 and 
C1J1429.0+4241) were flagged in this analysis, and all of 
the M — T relations were fit with and without these systems. 
Excluding the unrelaxed systems had no significant effect on 
any of the best fit relations, and so they were retained to 
avoid biasing the sample. 

Masses were derived within -R 2500 +), which falls within 
the detection radius for all of the WARPS clusters, and 
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R 200 M’ which corresponds to the estimated virial radius 
used in many other studies. Similarly to the method used 
with the L — T relation, the predicted self-similar evolution 
was factored out of the high-redshift cluster masses by fit¬ 
ting a relation of the form 


E(z) 


Ay (Z) 

Ay(0) 


1/2 


M 


AM 


= A 


kT 
6 keV 


( 12 ) 


to the data. 

Fig. 7 shows the M 2 5 oo(z) — T relation for the S03 and 
WARPS samples. The masses for both samples were derived 
assuming isothermality. The temperatures of the S03 clus¬ 
ters are emission-weighted temperatures measured within 
0.37?2 oo(z) , which corresponds closely to -R 2500 M > and are ex¬ 
trapolated over any central cool gas (see S03). The WARPS 
temperatures are also emission weighted, and are measured 
within the clusters’ detection radii. These measurements are 
consistent, under the assumption of isothermality, with the 
S03 temperature measurements. Our best-fitting relation 
for the S03 data is given by A = (1.92 ± 0.10) x 10 14 /i7o 
and B = 1.89 ± 0.15, while the best-fit to the WARPS 
data is parameterised by A = (1.49 ± 0.23) x 10 14 /?7o and 
B = 2.01 ±0.26. 

The M 2 5 oo( z ) — T relation was also measured for the 
WARPS systems when their masses were not scaled by the 
predicted evolution, and is plotted as a dot-dashed line in 
Fig. 7. These unsealed masses provide strong evidence for 
evolution of the M 2 5 oo(z) — T relation; the \ 2 of the local re¬ 
lation to the unsealed high-redshift data is X 2 /^ = 37.3/9. 
The scaled high-redshift data agree well with the local 
isothermal relation, with x 2 l v = 4.4/9. 

The M 200 (z) — T relation is shown in Fig. 8 for the S03 
and WARPS samples. The temperatures of the S03 clus¬ 
ters are as above, but measured within -R 200 M’ while the 
WARPS temperatures are again measured within ra- The 
isothermal S03 masses were used and the best-fit param¬ 
eters are summarised in Table 4. Again, the scaled high- 
redshift data are consistent with the local isothermal re¬ 
lation (x 2 /^ = 2.9/9). The best-fitting relation for the 
WARPS data using masses not scaled by the predicted evo¬ 
lution is also plotted in Fig. 8 as a dot-dashed line. The 
unsealed high-redshift data are inconsistent with the local 
isothermal M 2 oo(z) — T relation (x 2 /^ = 38.8/9), providing 
evidence for evolution. 

To enable comparisons with other work, masses of the 
WARPS clusters were also derived within radii enclosing 
fixed, redshift-independent density contrasts of A (z) = 
A(0) = 2500 and A (z) = A(0) = 200. These data were 
then fit with a relation of the form 

"«•> = (1S) 


so no scaling for the predicted self-similar evolution was 
made. The best-fitting slopes and normalisations are given 
in Table 4. 


8 THE M g - T RELATION 

The comparison of the WARPS and S03 samples also en¬ 
abled the investigation of the evolution of the M g — T re¬ 
lation. The gas mass is less dependent on uncertainties in 


the cluster temperature structure than the total mass. This 
is because the gas density profile is obtained easily from 
the X-ray surface-brightness profile, and the gas luminosity 
depends strongly on density (as p 2 ) and weakly on temper¬ 
ature (as T 1 / 2 ). This makes the M g — T relation potentially 
a more reliable method of exploring cluster evolution. An 
incorrect assumption of isothermality would still introduce 
systematic effects because the definition of overdensity radii 
depends on the total mass profile. The self-similar M g — T 
evolution prediction (Equation 5) is also subject to addi¬ 
tional assumptions about the relative distributions of the 
gas and dark matter (§4). 

The M g — T relations of the S03 and WARPS samples 
were derived within -R 2500 M and ^200 M > and are plotted in 
Figs. 9 and 10 respectively. The predicted self-similar evolu¬ 
tion was again factored out of the high-redshift cluster gas 
masses, with a relation of the form 


E(z) 


Av(2)\ 

Av(0 )J 


1/2 

Mg AM 


-(jSv) 


(14) 


fit to the data. As with the M — T relation, the unrelaxed 
clusters in our sample were flagged in this analysis, and the 
results were found to be independent of their inclusion or 
exclusion. M g — T relations were fit to the S03 data, using 
their gas masses derived under the assumption of isother¬ 
mality. Table 5 summarises the parameters of the various 
best-fitting relations. 

Measured within -R 2500 M’ the scaled high-redshift and 
low-redshift M g — T relations are consistent, with x 2 /v = 
4.8/9. The scaled high-redshift M g — T relation within 
R 200 M I s a l so consistent with the local S03 relation ( x 2 /v = 
8.0/9). 

The M g — T relations for the WARPS clusters when the 
masses were not scaled by the predicted self-similar evolu¬ 
tion are plotted as dot-dashed lines in Figs. 9 and 10. Within 
both radii, the unsealed high-redshift data are strongly in¬ 
consistent with the local isothermal relations, demonstrating 
the evolution of the relations. 

V02 also investigated the evolution of the M g — T re¬ 
lation, measuring gas masses within an overdensity radius 
defined in terms of the average baryon density of the uni¬ 
verse. Although their measurements are not directly com¬ 
parable to ours for this reason, they find evidence for weak 
evolution (with respect to the local relation) in the M g — T 
relation that is qualitatively consistent with our results. 


9 THE M - L RELATION 

The scaling between total mass and X-ray luminosity in the 
high-redshift clusters was then investigated. The WARPS 
sample was compared with the low-redshift HIFLUGCS 
sample of Reiprich & Bohringer (2002) in order to mea¬ 
sure any evolution in the M — L relation. The masses and 
luminosities published in Reiprich & Bohringer (2002) were 
scaled to Ho = 70 km s” 1 Mpc~ 4 , and clusters with kT < 
3 keV were removed for consistency with the WARPS clus¬ 
ters, leaving 52. The HIFLUGCS sample includes systems 
regardless of their morphology, although any strong sub¬ 
structure was excluded for the mass and luminosity de¬ 
terminations, and all of the high-redshift clusters (relaxed 
and unrelaxed) are included in this comparison. In addition, 
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Figure 7. M 25 oo(z) — T relation for the low-z S03 clusters and the relaxed high-z WARPS systems. Masses were measured within 
i?2500(z) assuming isothermality and scaled by the evolution predicted by the self-similar model. The dot-dashed line is the best fit to 
the unsealed high-redshift clusters (points not plotted) and can be used to judge the significance of the self-similar scaling. 



Figure 8. M 2 oo(z) — T relation for the low-z S03 clusters and the relaxed high-z WARPS systems. Masses were measured within R 200 (z) 
assuming isothermality and scaled by the evolution predicted by the self-similar model. The dot-dashed line is the best fit to the unsealed 
high-redshift clusters (points not plotted) and can be used to judge the significance of the self-similar scaling. 
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M — T relation 

A (10 14 hjg M 0 ) 

B 

notes 



Masses within #2500(2 
WARPS M 2500(z) - T 
S03 M 2 5oo( z ) — T 

) 

1.49 ±0.23 

1.92 ±0.10 

2.01 ±0.26 

1.89 ±0.15 

0.6 < 2 : < 1.0 clusters, isothermal 
Local clusters, isothermal masses 

masses 
(Fig. 7) 

(Fig. 7). 

Masses within #200(2) 
WARPS M 200(z) - T 

S03 M 200 ( z ) — T 

8.17 ±0.92 

9.08 ± 0.45 

1.25 ±0.37 
2.02 ±0.15 

0.6 < 2 : < 1.0 clusters, isothermal 
Local clusters, isothermal masses 

masses 

(Fig. 8) 

(Fig. 8). 

Unsealed masses within redshift-independent density contrast 

WARPS M2500 — T 1.10 zb 0.19 1.40 =b 0.53 0.6 < 2 < 1.0 clusters, isothermal 

WARPS M200 — T 5.14 zb 0.61 1.16 =b 0.39 0.6 < 2 : < 1.0 clusters, isothermal 

masses. 

masses. 



Table 4. Summary of the M — T relations discussed in §7. Masses of the high-redshift clusters were scaled by E(z)(A v (z)/ A v (0)) ly/2 to 
remove the predicted self-similar evolution, with the exception of the bottom section of the table. 



kT (keV) 


Figure 9. The M g 2500 (z) — T relations of the low-z S03 clusters and relaxed, high-z WARPS systems. Masses were measured within 
# 2500 ( 2 ) assuming isothermality, and scaled by the evolution predicted by the self-similar model. The dot-dashed line is the best fit to 
the unsealed high-redshift clusters (points not plotted) and can be used to judge the significance of the self-similar scaling. 


Mg — T relation A (1O 13 /i 7O 1 M0) 

B 

notes 


Gas masses within # 2500 ( 2 ) 

WARPS Mg 2500 (z) — T 1.11 dz 0.19 

S03 M g 2500 (z) -T 1.34 =b 0.12 

2.22 ±0.31 

2.29 ±0.33 

0.6 < 2 : < 1.0 clusters, isothermal 
Local clusters, isothermal masses 

masses (Fig. 9). 
(Fig. 9). 

Gas masses within # 200 ( 2 ) 

WARPS Mg 200(z) — T 10.0 zb 1.4 

S03 M g 200(z) - T 8.80 zb 0.52 

1.80 ± 0.49 
2.01 ±0.26 

0.6 < 2 : < 1.0 clusters, isothermal 
Local clusters, isothermal masses 

masses (Fig. 10). 
(Fig. 10). 


Table 5. Summary of the M g — T relations discussed in §8. Masses of the high-redshift clusters were scaled by ^(^(Av^/AvfO)) 1 / 2 
to remove the predicted self-similar evolution. 
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Figure 10. The M g 200(z) — T relations of the low-z S03 clusters and relaxed, high-z WARPS systems. Masses were measured within 
ij 2 00 (z) assuming isothermality, and scaled by the evolution predicted by the self-similar model. The dot-dashed line is the best fit to 
the unsealed high-redshift clusters (points not plotted) and can be used to judge the significance of the self-similar scaling. 


the masses of the local systems were determined assuming 
isothermality, which enables a fair comparison with the high- 
redshift masses. Properties extrapolated to 7?20owere used, 
and a relation of the form 
-7/6 


E(z) 


-7/3 


A V (Z) 

A v (0) 


^ 200 (z) 


= A 


M. 


200 (z) 


5 x 


5) 


was fit to high- and low-redshift data separately. 

Fig. 11 shows the best-fitting M — L relations for the 
two samples, and the parameters of the different relations 
are summarised in Table 6. The dot-dashed line shows the 
best-fitting M — L relation when no scaling for the predicted 
evolution is applied. The disagreement between that un¬ 
sealed high-z relation and the local data is striking, even 
given the large scatter in the local data. However, when 
the high-redsliift data are scaled by the predicted evolution, 
the best-fitting relation (solid line) agrees very well with 
the local relation (dashed line), with \ 2 l v = 9.3/9. The 
slopes of the low- and high-z relations are consistent, and 
are both steeper than the self-similar prediction of 4/3 (al¬ 
though not very significantly for the high-z data). At least 
some of the larger scatter in the HIFLUGCS data is due to 
cooling cores in some systems, as no correction for these was 
made (Reiprich & Bohringer 2002). 


10 DISCUSSION 

The normalisation of the WARPS high-redshift scaling re¬ 
lations are all consistent with self-similar evolution of the 
local relations in a ACDM cosmology. We emphasise the 
importance of properly quantifying the effect of assuming 


isothermality on the derived masses of high-redshift clus¬ 
ters when investigating the evolution of the M — T rela¬ 
tions. The L — T and M g 2 5 oo(z) — T relations are the most 
robust of the high-redshift relations measured here, sub¬ 
ject to the smallest systematic uncertainties and extrapo¬ 
lations. In particular the combined V02 and WARPS high- 
z sample enable the L — T relation to be measured with 
relatively small uncertainties. The M — L relation mean¬ 
while is predicted to show the strongest evolution (oc 
U(z)^ 7 ' /3 (A v (z)/A v (0)) _7 ' /6 ). The consistency of these re¬ 
lations with the predicted self-similar evolution provides 
strong evidence that that model is a good description of 
cluster evolution out to z « 1. 

A preliminary measurement of the high-redshift 
M 25 oo(z) — T relation in Maughan et al. (2003) found that it 
was consistent with no evolution, but did not strongly rule 
out self-similar evolution. This work improves on that ear¬ 
lier study in several ways. The sample used is larger, and 
the latest calibration was used in reanalysing the Chandra 
data (see §2). An important difference is that the masses are 
measured within radii corresponding to redshift-dependent 
density contrasts, estimated from the overdensity profile of 
each system. In the earlier study, the masses were mea¬ 
sured within a fraction of the virial radius estimated from 
each cluster’s temperature. The current study thus pro¬ 
vides a more reliable measurement of the evolution of the 
M 2 500 (z) - T relation. 

The slope of the combined WARPS and V02 L — T 
relation is consistent with its low-redshift counterparts, 
and steeper than the slope of 2 predicted by the self¬ 
similar model. This suggests that the same processes are 
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Figure 11. M — L relation for the low-z HIFLUGCS clusters and the high-z WARPS systems. Masses were measured within R 2 00(z) 
assuming isothermality and luminosities within the same radius were scaled by the evolution predicted by the self-similar model. The 
dot-dashed line is the best fit to the unsealed high-redshift clusters (points not plotted) and can be used to judge the significance of the 
self-similar scaling. 


M — L relation 

A (10 44 hj 0 2 erg s" 1 ) 

B 

notes 

Properties within 7? 2 00(z 
WARPS M- L 

Unsealed WARPS M - L 
HIFLUGCS M - L 

) 

2.76 ±0.81 

13.8 ±5.4 

3.40 ± 0.34 

1.90 ±0.49 
2.08 ± 0.62 
1.70 ±0.11 

0.6 < z < 1.0, luminosities scaled for predicted evolution (Fig. 11). 
0.6 < z < 1.0, no scaling for predicted evolution (Fig. 11). 

Local clusters assuming isothermality (Fig. 11). 


Table 6. Summary of the M — L relations discussed in §9. 


responsible for the steepening of both relations, and non- 
gravitational processes have already influenced cluster prop¬ 
erties by {z > 1). A possible interpretation of this is that 
the non-gravitational effects are important during the early 
stages of clusters’ lives, regardless of redshift, as the clusters 
discussed here are generally relaxed in appearance, suggest¬ 
ing the major part of their formation is complete. The high- 
redshift M — L relation also supports this interpretation. 

The slopes of M 20 o(z) — T relation above 3 keV in both 
the WARPS and S03 samples are consistent with the self¬ 
similar prediction of 3/2. When the cooler S03 systems are 
included, the slope steepens, which is consistent with non- 
gravitational effects having a larger relative contribution in 
low-mass systems. Observations of cooler (< 3 keV) clus¬ 
ters at high-redshift are required to test whether there is 
any evolution in this effect. There is also a weak trend for 
the M 2 5 oo(z) — T and M g 2 5 oo(z) — T relations to be steeper 
than the M 20 o( z ) — T and M g 2 oo(z) — T relations, whose 
slopes agree more closely with the self-similar slope. This 
trend is present, though not strongly significant, in the 
high- and low-redshift relations. These results are consistent 


with a scenario in which non-gravitational processes have a 
stronger effect in the central regions of clusters, which is 
more noticeable in cooler systems. 

Generally, the M — T relations show a self-similar slope 
above 3 keV, the M g — T relations are self-similar, or slightly 
steeper, and the L — T relations are steeper than self-similar 
prediction. This suggests that although non-gravitational 
processes do not have a strong influence on the dark mat¬ 
ter in > 3 keV clusters, those processes can still have an 
important effect on the gas in more massive systems. 


10.1 Evolution of the L — T relation 

As we saw in the §6, the evolution of the L — T relation is 
consistent with the self-similar model described in §4. The 
relatively large size of the combined WARPS and V02 sam¬ 
ples also enables alternative models of cluster evolution that 
are used in the literature to be tested. As discussed in section 
§4, it is popular to use a fixed, redshift-independent density 
contrast to define the outer radius of clusters. In this case, 
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the L — T relation is given by 

LaE(z)- 1 x kT B . (16) 

Now the A (z)~ 1,/2 part of the normalisation is no longer 
required, and as this is an increasing function of z, the pre¬ 
dicted L — T evolution is smaller than that predicted by the 
A (z) model. The measured luminosities are slightly larger 
when extrapolated to R-a instead of Ra(z), which acts in 
the opposite sense, increasing the apparent evolution. This 
evolution model was tested by comparing the combined 
WARPS and V02 L — T relation, with luminosities extrap¬ 
olated to R 200 (not R2oo(z)) and scaled by E(z )~ x , with 
the local AE99 relation. The E{z) _1 scaling was insufficient 
to reduce the luminosities to be consistent with the local 
L — T relation, with \ 2 l v = 51/22. This indicates that a 
redshift-dependent density contrast is required in the self¬ 
similar model to reconcile the high-redsliift and local L — T 
relations. 

A simple way of measuring the evolution of the L — T 
relation is to assume that it evolves as 

La x ( l + z) a kT B . (17) 

V02 found a = 1.5 ± 0.3 at 90% confidence by comparing 
their high-redshift data with the local Markevitch (1998) 
L — T relation. The combined WARPS and V02 L — T data 
were used to make a similar measurement. The luminosity of 
each cluster extrapolated to R 200 was divided by (1 + z) a , 
and the scaled data were compared with a local relation. 
This process was performed for a range of values of a, and 
for both of the local AE99 and Markevitch (1998) L — T 
relations and the results are shown in Fig. 12. 

We find the best agreement between the scaled high- 
redshift data and the local AE99 relation, with a = 1.3±0.2 
(at the 90% level) and = 23.0/22. Using the Marke¬ 
vitch (1998) relation as a low-redshift baseline results in a 
slightly lower value of a and a poorer fit. A measurement of 
a independent of that of V02 was also performed by using 
the WARPS sample alone. Compared with the AE99 local 
relation we find a = 0.8 ± 0.4 (90% level) while comparison 
with the Markevitch (1998) relation gives a = 0.7±0.4 (90% 
level; see Fig. 13. These measurements are marginally con¬ 
sistent with the values of a = 1.5 ±0.3 (68% level) found by 
Lumb et al. (2004) and 1.8 ±0.3 (68% level) found by Kotov 
& Vikhlinin (2005) in samples of high-redshift clusters ob¬ 
served with XMM-Newton. We thus conclude that a « 1.2 
and that the range of values found is at least partially due to 
the uncertainty in the exact form of the local L — T relation. 

While current the data cannot distinguish between 
a L x (1 ± z) a kT B model and the full self-similar 
LE(z)- 1 A( 2 ) -1 / 2 x kT B model for the evolution of the 
L — T relation, the latter is more satisfactory because it is 
physically motivated, and does not introduce an aditional 
free parameter (a). 

Recent theoretical work has attempted to include the 
effects of preheating and radiative cooling in the simple self¬ 
similar scaling relations. Voit (2005) shows that introducing 
a cooling threshold K c = T 2 ^ 3 t(z ) 2 ^ 3 , where gas with an 
entropy less than K c radiates all of its thermal energy away 
within a Hubble time t(z), produces a “cooling threshold” 
L — T relation of the form 

L x kT 25 E(z)~ 1 t(z)~ 1 . (18) 


Voit (2005) also investigates the effect that including a mod¬ 
est initial amount of entropy in the gas before it is accreted 
onto a forming cluster has on the L — T relation. If the ini¬ 
tial entropy level is chosen to match the observation that 
A'(0.1i?2oo) x kT 2 / 3 (Ponman et al. 2003), then the result¬ 
ing “altered similarity” L — T relation is 

L x kT 3 E(z)~ 3 t(z)~ 2 . (19) 

Both of these modified self-similar models predict 
steeper L — T slopes which are in better agreement with 
observations than the simple self-similar slope of 2. The evo¬ 
lution of the L — T relation, however, is much milder in the 
modified self-similar models than in the simple self-similar 
model. Fig. 14 shows the evolution of the normalisation of 
the L — T relation for a variety of models. The data points 
show the observed evolution in the combined WARPS and 
V02 sample relative to the AE99 and Markevitch (1998) lo¬ 
cal relations. The data are binned by redshift, and the data 
points show the weighted mean and weighted standard de¬ 
viation for each bin. While dividing the data into redshift 
bins may be pushing the limits of the current data, Fig. 14 
serves to illustrate the range of predictions for the evolution 
of the L — T relation. The overall conclusion on the evolu¬ 
tion of the L — T relation is that the self-similar model is 
a good description of the evolution out to 2 « 1. Larger 
samples at high redshifts will enable more detailed models 
to be tested, for example, indicating whether the evolution 
continues to increase with redshift, or flattens out in line 
with the predictions of the modified self-similar models. 


10.2 Continuous formation predictions 


The M — T relation defined in Equation 4 is derived under 
the late-formation approximation, in which a cluster forms 
in a single collapse, terminating with the system having just 
virialised at the redshift of observation. Several authors ( e.g. 
Lacey & Cole 1993; Voit 2000) have derived M — T relations 
in a more realistic continuous-formation scenario, in which 
clusters grow by accumulating much smaller virialised ob¬ 
jects. In a flat ACDM cosmology, Voit (2000) predicts 


kT = (8.0 keV) 


M 


10 15 /i _1 


.-innMfTj 


2/3 


Ut) 

£c(to) 


( 20 ) 


where £ c (t) is the specific energy of a shell of matter which 
collapses onto the cluster at a time t (see the appendix of 
Voit 2000 for details). I 11 this model, the evolution of the 
normalisation of the M — T relation is different from the 
late-formation predictions. 

The continuous-formation model predicts that clusters 
of a given mass are cooler than the late-formation model pre¬ 
dictions. However, the difference between the models is small 
at z « 1, requiring measurements of the M — T normalisa¬ 
tion to a precision of < 2% to distinguish between them, 
and our results are consistent with either of these cluster- 
formation scenarios. The agreement between our measure¬ 
ments of the evolution of the M — T and M g — T relations, 
and these models indicates that the properties of galaxy 
clusters reflect the properties of the universe at their redshift 
of observation. In the late-formation approximation this is 
simply because clusters formed at the redshift of observa¬ 
tion, while in the continuous-formation model, the accretion 
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Figure 12. The x 2 values for the comparison of the scaled high-redshift WARPS and V02 L — T data with the local relations. The 
luminosities of the high-redshift systems were scaled by (1 + z)~ a for different values of a. 



Figure 13. The x 2 values for the comparison of the scaled high-redshift WARPS L — T data alone with the local relations. The 
luminosities of the high-redshift systems were scaled by (1 + z)~ a for different values of a. 
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Figure 14. The evolution of the L — T relation normalisation predicted by different models. The data points are the observed evolution 
in the combined WARPS and V02 sample, in redshift bins, relative to different local L — T relation. 


of matter onto clusters continually realigns their properties 
with those of the evolving universe. 


10.3 Probing the redshift of virialisation 

If clusters form in a single collapse, then they may virialise 
at a redshift (z v ) which is larger than their redshift of ob¬ 
servation (z 0 bs). Their properties would then reflect those 
of the universe at an epoch earlier than z 0 bs, and the self¬ 
similar evolution which assumes z 0 bs = z v would not be a 
good description of the high-redshift scaling relations. If one 
is willing to assume that clusters obey self-similar evolution, 
then the high-redshift data can be used to place interesting 
constraints on the mean redshift of formation of the high- 
redshift sample (z7h). If the mean redshift of formation of 
the local sample is denoted as z©, then the ratio of the nor¬ 
malisation of the low- and high-redshift L — T relations is 
given by [E(z^h) A v (z7h) 1/2 ]/[F(zvi)A v (zvi) 1/a J ( c.f. Equa¬ 
tion 6). 

Under the assumptions given above, the ratio of the 
normalisations of the high- and low-redshift L — T relations 
can be used to relate zji and z7h- The high-redshift L — T re¬ 
lation thus allows z^h to be measured for an assumed z w i- A 
relation with a slope fixed at the local value of 2.88 (AE99) 
was fit to the unsealed WARPS and V02 liigh-z L — T data, 
and the best-fit normalisation, along with the maximum and 
minimum normalisations allowed by the data at the 99% 
level were found. These were then used to derive the con¬ 
straints on z^h plotted in Figure 15. 

If the local or high-redshift clusters formed at a red¬ 
shift other than z 0 bs, then the overdensity radii within which 
the luminosities are measured would be inappropriate. How¬ 


ever, as the overdensity radii are all large and the surface- 
brightness is low in outer regions of the clusters, the use of 
different radii has a negligible effect. To illustrate this, if we 
consider a 6 keV cluster with j3 = 0.67 and r c = 100 kpc 
observed at z = 0, we find i? 2 oo(z) = 2.0 Mpc for z v = 0 and 
R 2 oo(z) =1-6 Mpc for z v = 1. The luminosity of the cluster 
increases by just 1% between these two radii. 

Figure 15 shows that if the local clusters are assumed 
to have virialised at z = 0, then the high-redshift data are 
consistent with z v h = z 0 bs, with Zvh < 1-5 at the 99% level. 
We strongly rule out a common redshift of virialisation for 
the local and high-redshift samples, regardless of the redshift 
of formation of the local sample. That is not to say that 
the high-redshift clusters cannot evolve onto the local L — T 
relation, via mergers and accretion resetting their properties 
to new values of z v . These results are simply indicating that 
the local clusters did not form fully at the same high redshift 
as the distant clusters and then evolve passively. 


10.4 Properties of the WARPS clusters 

The observed gas properties of the WARPS high-redshift 
sample are generally consistent with those of local clusters. 
The mean surface-brightness profile slope, f3 = 0.66 ± 0.05 
is very close to the canonical value of 2/3 (Jones & Forman 
1984). Also, the mean ratio of core radius to “virial radius” 
r c /i? 200 (z) = 0.14±0.05 agrees well with the values found by 
Sanderson & Ponman (2003) for > 1 keV clusters. Further¬ 
more, the gas-mass fractions found in the high-redshift clus¬ 
ters agree with those found in local clusters. The weighted 
mean / gas at J? 2500 (z) is 0.069±0.012, increasing to 0.11±0.02 
at i? 200 (z) which agrees with the values found by S03. These 
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Figure 15. Constraints on the mean redshift of virialisation of the combined WARPS and V02 high-redshift sample derived from the 
evolution of the L — T relation. The dotted line indicates a common redshift of virialisation of the high- and low-redshift samples. 
Errorbars are 99%. 


results indicate that the gas distribution is the same in low- 
and high-redshift clusters. 

The metal abundances measured in the high-redshift 
systems are plotted against redshift in Fig. 16. The data 
(excluding C1J1559.1+6353 with its very high, poorly con¬ 
strained measurement) are consistent with the canonical 
value of 0.3Zq out to z = 1; the weighted mean of the 
values is 0.28 ± O.llZg. This is consistent with the high- 
redshift of enrichment of the intra-cluster medium found by 
other authors ( e.g. Mushotzky & Loewenstein 1997; Tozzi 
et al. 2003). 


11 CONCLUSIONS 

The overall picture provided by this study of the evolution 
of the cluster scaling relations is that within the statistical 
limits of the current data, the evolution of galaxy clusters 
out to 2 « 1 is described well by the self-similar model. The 
large-scale properties of clusters are dominated by the den¬ 
sity of the universe at the epoch at which they are observed. 
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